Towards the Generation of Overspecified Multimodal Referring Expressions

نویسندگان

  • Ielka van der Sluis
  • Emiel Krahmer
چکیده

Recently, there has been an increased interest in the generation of multimodal referring expressions (e.g., Salmon-Alt and Romary, 2000; Lester et al., 1999; André and Rist, 1996; Reithinger, 1992; Claassen, 1992). The task involved in generation of multimodal referring expressions is to decide what is the best way to refer to a target via different modalities in the current context. Existing algorithms that generate referring expressions focus on distinguishing references (e.g., descriptions that only apply to the referent and not to any other object in the domain) or minimal references (e.g., the shortest distinguishing descriptions possible for a given referent). However in human conversation overspecified references (e.g., descriptions which include more information than necessary for identification) are relatively more common (Arts, 2004; Beun and Cremers, 1998; Pechmann, 1989). But in fact, what kind overspecification do human speakers produce and why? And secondly, how can this be mimicked in automatic generation? This paper attempts to answer these questions by: (1) addressing overspecification as occurring in human communication with respect to two production experiments used for the evaluation of the graph-based multimodal algorithm proposed by Krahmer and van der Sluis (2003) and (2) proposing a variant of this multimodal algorithm that can generate overspecified descriptions based on strategies observed in human communication. The basic multimodal algorithm, an extension of the graphbased algorithm (Krahmer et al., 2003), differs from other multimodal algorithms in that it can generate various pointing gestures differing in precision as a function of distance, where the type of pointing gesture co-varies with the amount of linguistic information in the accompanying description. The algorithm employs a set of cost functions to select properties, relations and pointing gestures according to preference. In this paper a notion of certainty is proposed that together with the cost functions causes a flexibility with which the algorithm can generate the whole range of possible referring expressions, from minimal ones to the utmost overspecified ones. This paper is organized as follows: In Section 2 overspecification in human communication is discussed. Section 3, presents a variant of the multimodal graph-based algorithm, that is able to generate unimodal and multimodal overspecified referring expressions. Section 4 ends this paper with a discussion.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Generating overspecified referring expressions: the role of discrimination

We present an experiment to compare a standard, minimally distinguishing algorithm for the generation of relational referring expressions with two alternatives that produce overspecified descriptions. The experiment shows that discrimination which normally plays a major role in the disambiguation task is also a major influence in referential overspecification, even though disambiguation is in p...

متن کامل

Towards a Balanced Corpus of Multimodal Referring Expressions in Dialogue

This paper describes an experiment in which dialogues are elicited through an identification task. Currently we are transcribing the collected data. The primary purpose of the experiment is to test a number of hypotheses regarding both the production and perception of multimodal referring expressions. To achieve this, the experiment was designed such that a number of factors (prior reference, f...

متن کامل

Generating Expressions that Refer to Visible Objects

We introduce a novel algorithm for generating referring expressions, informed by human and computer vision and designed to refer to visible objects. Our method separates absolute properties like color from relative properties like size to stochastically generate a diverse set of outputs. Expressions generated using this method are often overspecified and may be underspecified, akin to expressio...

متن کامل

Towards Generation Of Fluent Referring Action In Multimodal Situations

Referring actions in multimodal situations can be thought of as linguistic expressions well coordinated with several physical actions. In this paper, what patterns of linguistic expressions are commonly used and how physical actions are temporally coordinated to them are reported based on corpus examinations. In particular, by categorizing objects according to two features, visibility and membe...

متن کامل

Generating Multimodal References 1 Generating Multimodal References Generating Multimodal References 2

This paper presents a new computational model for the generation of multimodal referring expressions, based on observations in human communication. The algorithm is an extension of the graph-based algorithm proposed by Krahmer et al. (2003) and makes use of a so-called Flashlight Model for pointing. The Flashlight Model accounts for various types of pointing gestures of different precisions. Ba...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005